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1 Towards on-line analytical mining in iarge databases 
Jiawei Han 

March 1998 ACM SIGMOD Record, Volume 27 issue i 

Full text available: '^i)df[387..04„KB), Additional Information: fyJJ„citation/ abstra^^ citmcis, index.terms 

Great efforts have been paid in the Intelligent Database Systems Research Lab for the 
research and development of efficient data mining methods and construction of on-line 
analytical data mining systems. Our work has been focused on the integration of data 
mining and OLAP technologies and the development of scalable, integrated, and multiple 
data mining functions. A data mining system, DBMiner, has been developed for Interactive 
mining of multiple-level knowledge in large relational databases and ... 

2 Discovering internet marketing inteiligence through online analytical web usage mining 
AlexG. Buchner, Maurice D. Mulvenna 

December 1998 ACM SIGMOD Record, Volume 2? issue 4 

Full text available: ■^.pdf^TZ^^Og, K3) Additional Information: fujj. cjtatjpn., abstract, citings, Index terms 

This article describes a novel way of combining data mining techniques on Internet data in 
order to discover actionable marketing intelligence in electronic commerce scenarios. The 
data that is considered not only covers various types of server and web meta information, 
but also marketing data and knowledge. Furthermore, heterogeneity resolution thereof and 
Internet- and electronic commerce-specific pre-processing activities are embedded. A 
generic web log data hypercube is formally defined ... 

3 Web mining for web personalization 
Magdalinl Eirinaki, Michalis Vazirgiannis 



February 2003 ACM Transactions on Internet Technology (TOIT), Volume s issue i 

Additional Information: M[.citatJori, abstract, references, citings, index 
terms , review 



Full text available: "Pl pdtn^Sa 73 KG) 



Web personalization is the process of customizing a Web site to the needs of specific users, 
taking advantage of the knowledge acquired from the analysis of the user's navigational 
behavior (usage data) in correlation with other information collected in the Web context, 
namely, structure, content, and user profile data. Due to the explosive growth of the Web, 
the domain of Web personalization has gained great momentum both in the research and 
commercial areas. In this article we present a survey ... 



Keywords: WWW, Web personalization, Web usage mining, user profiling 
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Using a starfield visualization for analyzing product performance of online stores 
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Juhnyoung Lee, Mark Podlaseck 

October 2000 Proceedings of the 2nd ACM conference on Electronic commerce 

Full text available: 'g pdf(353.32 KB) Additional Information: full citation, references , index terms 



Keywords: electronic commerce, marketing, merchandising, visualization 



* Automatic personalization based on Web usage mining 
Bamshad Mobasher, Robert Cooley, Jaideep Srivastava 
August 2000 Communications of the ACM, volume 43 issue s 

Full text available: •g.^d^zmdailg Additional Information: full citation, references, citings, index terms 



® .Web.usage.mining.for„^^ 
Myra Spillopoulou 

August 2000 Communications of the ACM, Volume 43 issue s 

Full text available: ^pdt|132MB) ji information: full citation , references, citings, index terms 
htm[(45..37 KB). ^ 



7 Survey articles: Web usage mining: discovery and applications of usage patterns from Q 
Web data 

Jaideep Srivastava, Robert Cooley, Mukund Deshpande, Pang-Ning Tan 
January 2000 ACM SIGKDD Explorations Newsletter, Volume 1 issue 2 

Full text available: '^pcifQ,44.MBj Additional Information: MLcitatioQ, abstract, references, citjngs 

Web usage mining is the application of data mining techniques to discover usage patterns 
from Web data, in order to understand and better serve the needs of Web-based 
applications. Web usage mining consists of three. phases, namely preprocessing, pattern 
discovery, and pattern analysis. This paper describes each of these phases in detail. Given 
its application potential, Web usage mining has seen a rapid increase in interest, from both 
the research and practice communities. This pap ... 



Keywords: data mining, web usage mining, world wide web 
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Ron Kohavi, Neal J. Rothleder, Evangelos Simoudis 

August 2002 Communications of the ACM, volume 45 issue 8 

Full text available: 'P| pdfd 04,67 K8) ^ ■ z - . 

^ \,^A »rr^ Additional Information: full citation, abstract, references, index terms 
[a.Ml7;Ji2/iM.}<B) 

The goal is business effectiveness through 'verticalization,* usability, and integration with 
operational systems. 



Mjnjng..mu!timedia.d^^^ 

Osmar R. ZaYane, Jiawei Han, Ze-Nian Li, Jean Hou 

November 1998 Proceedings of the 1998 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^ pdf(377.84 K3i Additional Information: M.citatign, abstract, references, cjtings, .index 
^ terms 

Data Mining is a young but flourishing field. Many algorithms and applications exist to mine 
different types of data and extract different types of knowledge. Mining multimedia data is, 
however, at an experimental stage. We have implemented a prototype for mining high-level 
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multimedia information and knowledge from large multimedia databases. MultiMedia Miner 
has been designed based on our years of experience in the research and development of a 
relational data mining system, DBMiner, in the Inte ... 

Keywords: data cube, data mining, data warehousing, image analysis, information 
retrieval, multimedia, world-wide web 



''^ Effective. data. m 

Khalii M. Ahmed, Nagwa M. El-Makky, Yousry Taha 

November 1998 Proceedings of the 1998 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: '^ pdf(292.82 KB^ Additional Information: full citation, abstract, references, index temns 

An effective Data Mining (DM) system for mining multiple-level knowledge from Data 
Warehouse {D\N), DB and flat files of raw data is proposed. The DVJ represents the 
backbone of the proposed architecture. Intermediate, as well as final results of mining are 
Incorporated into the D\N for efficient processing of further queries. A Markov Chain 
mathematical model is developed for managing data dependency and consistency in the 
DM/. An adaptive hybrid view technique is introd ... 

AnMMMs.plnayigMiQn.be^^ 
Bettina Berendt, Myra Spiliopoulou 

March 2000 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 1 

Full text available: ■^i)d!t28J„..14.KB) Additional Information: fuJl.citat'[on, abstract, citings, jndex.terms 

The analysis of web usage has mostly focused on sites connposed of conventional static 
pages. However, huge amounts of information available in the web come from databases or 
other data collections and are presented to the users in the form of dynamically generated 
pages. The query interfaces of such sites allow the specification of many search criteria. 
Their generated results support navigation to pages of results combining cross-linked data 
from many sources. For the analysis of visitor naviga ... 

Keywords: Conceptual hierarchies, Data mining, Query capabilities, Web databases, Web 
query interfaces, Web usage mining 



^2 Web mining and its SQL based parallel execution 
Masaru Kitsuregawa, Takahiko Shintani, Iko Pramudiono 

January 2001 Australian Computer Science Communications , Proceedings of the 

workshop on Information technology for virtual enterprises ITVE '01 , 

Proceedings of the workshop on Information technology for virtual 

enterprises ITVE '01, volume 23 issue 6 

Full text available: ■p pdt{674 03 m\ 

■sT ' Additional Information: fuii citation , abstract, references 

Web mining can be classified into two categories, Web access log nnining and Web structure 
mining. We performed association rule mining and sequence pattern mining against the 
access log which was accumulated at NTT Software Mobile Info Search portal site. Detail 
web log mining process and the rules we derived are reported in this paper. The parallel 
association rule mining is explored on large scale PC cluster system. Parallelism is key to 
improve the performance. We achieved substantial speed u ... 

Technical opinion: Comonent-based data nnining frameworks 
Fernando Berzal, Ignacio Blanco, Juan-Carlos Cubero, Nicolas Marin 
December 2002 Communications of the ACM, volume 45 issue 12 

Full text available: ^ pdt(llC\82 . r ■ . 

j^uA \'Ar,^r,x/^^ Additional Information: full citation, abstrad , references, inciex terms 
jjy|.rHmJxl8:89„KB} 
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Robert L. Grossman, Mark F. Hornick, Gregor Meyer 
August 2002 Communications of the ACM, volume 45 issue 8 

Full text available: ' ^pdf(77.42 KB^ |g| Additional Information: full citatior^ abstract, references, citings, index 
hte!{17,27 KB) tefTTis 

Lacking standards for statistical and data mining models, applications cannot leverage the 
benefits of data mining. 

15 The use of v/eb structure and content to identify subjectively interesting web usage 

patterns 
Robert Cooley 

May 2003 ACM Transactions on Internet Technology (TOIT), Volume 3 issue 2 

Full text available: ^ r)dT'540.(}6 KB) Additional Information: full citation, abstract , references, index terms 

The discipline of Web Usage Mining has grown rapidly in the past few years, despite the 
crash of the e-commerce boom of the late 1990s. Web Usage Mining is the application of 
data mining techniques to Web clickstream data in order to extract usage patterns. Yet, 
with all of the resources put into the problem, claims of success have been limited and are 
often tied to specific Web site properties that are not found in general. One reason for the 
limited success has been a component of Web Usage Mi ... 

Keywords: Data mining, Web usage mining, World Wide Web 



16 Web mining: A framew^ork for web table mining 
Yingchen Yang, Wo-Shun Luk 

November 2002 Proceedings of the 4th international workshop on Web information and 
data management 

Full text available: "^i5dfC125,03.KB) Additional Information: Ml.citatioQ, Bbstraoi, references, mdex.terms 

Web table mining is about information extraction from tables published inside web pages as 
HTML texts. Most previous work on this subject makes use of the tags to discover 
components of the table. Our work treats web as a distinct publication media, in two ways. 
We argue that new types of table format have been developed specially for the web. We 
also argue that the visual cues embedded within the HTML text, are utilized by the authors 
to direct the viewer on how to read the contents contained a ... 

Keywords: data extraction, information extraction, table mining, web pages 



''^ Search ing..f or depe^^^^ 

Toon Calders, Raymond T. Ng, Jef Wijsen 

September 2002 ACM Transactions on Database Systems (TODS), Volume 27 Issue 3 

Full text available: H pdf(41 1 .24 KBi Additional Information: full citation , abstract, ^^ferences . cltinas. index 
' terms. 

The notion of roll-up dependency (RUD) extends functional dependencies with 
generalization hierarchies. RUDs can be applied in OLAP and database design. The problem 
of discovering RUDs in large databases is at the center of this paper. An algorithm is 
provided that relies on a number of theoretical results. The algorithm has been 
implemented; results on two real-life datasets are given. The extension of functional 
dependency (FD) with roll-ups turns out to capture meaningful rules that are outsi ... 

Keywords: Data mining, functional dependencies, knowledge discovery, online analytical 
processing 
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1^ A methodoiogv for workload characterization of E-commerce sites B 
Daniel A. Menasc^, Virgilio A. F. Almeida, Rodrigo Fonseca, Marco A. Mendes 
November 1999 Proceedings of the 1st ACM conference on Electronic commerce 

Full text available: ■^PdiQM-Sl.KBJ Additional Information: Mi.c.itatiQri, references, citings, indexierms 



Knowjedge„disco^^^ 
Thennistoklis Palpanas 

September 2000 ACM SIGMOD Record, volume 29 issue 3 

Full text available: pdf(240.77 K3) Additional Information: full citaiion. abstract, citings , index iems 

As the size of data warehouseis increase to several hundreds of gigabytes or terabytes, the 
need for nnethods and tools that will automate the process of knowledge extraction, or 
guide the user to subsets of the dataset that are of particular interest, is becoming 
prominent. In this survey paper we explore the problem of identifying and extracting 
interesting knowledge from large collections of data residing in data warehouses, by using 
data mining techniques. Such techniques have the ability to i ... 
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Jesper Andersen, Anders Giversen, Allan H. Jensen, Rune S. Larsen, Torben Bach Pedersen, 
Janne Skyt 

November 2000 Proceedings of the 3rd ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^.pdf(266,2S K3) Additional Information: MLcjtatjon, references, citings. 
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